PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_15254_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family bHLH
Protein Properties Length: 525aa    MW: 59147.2 Da    PI: 5.2507
Description bHLH family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_15254_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HLH16.31.8e-05341365730
                                 HHHHHHHHHHHHHHHHHHCTSCC.C CS
                         HLH   7 erErrRRdriNsafeeLrellPk.a 30 
                                  +Er+RR+++N+++  Lr+l P+  
  Cotton_A_15254_BGI-A2_v1.0 341 VAERKRRKKLNERLYSLRSLGPNgF 365
                                 68*****************988843 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF142157.2E-217153IPR025610Transcription factor MYC/MYB N-terminal
Gene3DG3DSA:4.10.280.104.4E-4341364IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
CDDcd048730.00143407470No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009555Biological Processpollen development
GO:0048658Biological Processanther wall tapetum development
GO:0005634Cellular Componentnucleus
GO:0000978Molecular FunctionRNA polymerase II core promoter proximal region sequence-specific DNA binding
GO:0046983Molecular Functionprotein dimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 525 aa     Download sequence    Send to blast
MNIFQNLMER LRQVVGPKGW DYCVLWKLSD DQRFLEWVDC CCGGAESIES SGELQFPVTT  60
VLPCRDVMFQ HPKTRSCELL AQLPSCMPLD SGSHAQALIS NQPKWFNFSN NSDPNVLEEI  120
VGTRILIPVA EGLIELFVAK QVCEDQNVMD YIVTLCNISL EQSSMMNSSC MDTHFTALNA  180
QALNEFQAKT HLSNENDRKD PIINHFQPPL TTTLETLNLP YDISIDQIRS TNTLQQYHYL  240
SDDKNRKNMD VCVEGSHEVF LSDKVVNPLK SSVDNGLQEI DPLNSMVTNE SMVIQGNEKD  300
SIKQENGRSD SISDCSDQND DEDDARYQRR AGSKGQSKNL VAERKRRKKL NERLYSLRSL  360
GPNGFPVGGN GSVSRAQNQE VDTCADKTQQ MEVQVEVAQI DGNQFFVKVF SEHKPGGFVR  420
LMEALDSLGL EVTNANVNSF RGLVSNVFKV EKKDSEMVQA DHVRESLLEL TRTPSKGLSE  480
MVKASETNNG NGVECNYHNQ QQHLHNQRIT SHHHQLHHFP PKQAA
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1342349ERKRRKKL
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX6228414e-96JX622841.1 Gossypium hirsutum clone NBRI_GE69754 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012436385.10.0PREDICTED: transcription factor ABORTED MICROSPORES isoform X1
RefseqXP_012436386.10.0PREDICTED: transcription factor ABORTED MICROSPORES isoform X2
TrEMBLA0A0D2R8Q70.0A0A0D2R8Q7_G
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM75892741
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G16910.11e-110bHLH family protein